A Novel Approach of Calculating Information Entropy in Information Extraction
نویسندگان
چکیده
منابع مشابه
A Novel Approach of Calculating Information Entropy in Information Extraction
Noise data of web page is easy to cause the topic drift problem in web information extraction. To improve the accuracy of web information extraction effectively, a novel calculation method of mixing entropy is presented, which can more accurately reflect the topic information of web page. The information block is discussed under the multi-page site environment. The impacts of information within...
متن کاملA Mutual Information Approach to Calculating Nonlinearity
A new method to measure nonlinear dependence between two variables is described using mutual information to analyze the separate linear and nonlinear components of dependence. This technique, which gives an exact value for the proportion of linear dependence, is then compared with another common test for linearity, the Brock, Dechert and Scheinkman (BDS) test. Copyright c © 0000 John Wiley & So...
متن کاملInformation Extraction Information Extraction
In this paper we present a new approach to extract relevant information by knowledge graphs from natural language text. We give a multiple level model based on knowledge graphs for describing template information, and investigate the concept of partial structural parsing. Moreover, we point out that expansion of concepts plays an important role in thinking, so we study the expansion of knowledg...
متن کاملentropy, negentropy, and information
evaluation criteria for different versions of the same database the concept of information, during its development, is connected to the concept of entropy created by the 19th century termodynamics scholars. information means, in this view point, order or negentropy. on the other hand, entropy is connected to the concepts as chaos and noise, which cause, in turn, disorder. in the present paper, ...
متن کاملA New Approach for Web Information Extraction
With the exponentially growing amount of information available on the Internet, an effective technique for users to discern the useful information from the unnecessary information is urgently required. Cleaning web pages for web data extraction becomes critical for improving performance of information retrieval and information extraction. So, we investigate to remove various noise patterns in W...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Database Theory and Application
سال: 2013
ISSN: 2005-4270,2005-4270
DOI: 10.14257/ijdta.2013.6.5.05